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Abstract 

The ability to target DNA metliylation toward a single, user-designated CpG site in vivo may have wide applicability for basic 
biological and biomedical research. A tool for targeting methylation toward single sites could be used to study the effects of 
individual methylation events on transcription, protein recruitment to DNA, and the dynamics of such epigenetic 
alterations. Although various tools for directing methylation to promoters exist, none offers the ability to localize 
methylation solely to a single CpG site. In our ongoing research to create such a tool, we have pursued a strategy employing 
artificially bifurcated DNA methyltransferases; each methyltransferase fragment is fused to zinc finger proteins with affinity 
for sequences flanking a targeted CpG site for methylation. We sought to improve the targeting of these enzymes by 
reducing the methyltransferase activity at non-targeted sites while maintaining high levels of activity at a targeted site. Here 
we demonstrate an in vitro directed evolution selection strategy to improve methyltransferase specificity and use it to 
optimize an engineered zinc finger methyltransferase derived from M.Sssl. The unusual restriction enzyme McrBC is a key 
component of this strategy and is used to select against methyltransferases that methylate multiple sites on a plasmid. This 
strategy allowed us to quickly identify mutants with high levels of methylation at the target site (up to ~80%) and nearly 
unobservable levels of methylation at a off-target sites (<1%), as assessed in £ coli. We also demonstrate that replacing the 
zinc finger domains with new zinc fingers redirects the methylation to a new target CpG site flanked by the corresponding 
zinc finger binding sequences. 
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Introduction 

CpG methylation is one of the most extensively studied 
epigenetic modifications; it broadly regulates and maintains 
transcriptional repression. CpG methylation is involved in proper 
ceUular differentiation, heterochromatin formation and in main- 
taining chromosomal stability [1]. Further, aberrant methylation 
patterns cause or are observed in numerous diseases. Imprinting 
defects lead to disorders such as Prader-Willi and Angelman 
syndromes [2]. Notably, global genomic hypomethylation and 
local hypermethylation of CpG islands (CGIs) commonly occur in 
cancer [3]. Though much has been learned about how methyl- 
ation patterns are established and erased, the causes of aberrant 
methylation and the reestablishment of methylation patterns 
during development remain active areas of research. To study 
the effects and dynamics of DNA methylation, it would be 
generally useful to target methylation toward specific, user-defined 
sequences. 

Several groups have engineered methyltransferases that direct 
methylation towards user-defined DNA sequences. The general 
strategy, pioneered by Xu and Bestor, involves fusion of a 
sequence specific DNA binding protein to a methyltransferase 



enzyme [4]. These constructs have been used to affect methyla- 
tion, in vitro, in E. coli, and in cancer cell lines [5-10]. These 
directed methyltransferases have been shown to stably and 
heritably reduce the expression of Sox2 and Maspin genes [11]. 
Siddique et al. demonstrated that targeting methylation towards 
the VEGF-A promoter significantly reduced gene expression in 
SKOV3 cells [12]. A recent review summarizes much of the 
literature on targeted methylation [13]. However, the engineered 
enzymes mentioned above methylate multiple CpG sites adjacent 
to the targeted DNA sequence. Despite the successes of these 
studies in biasing methylation to a particular region, only a few 
studies have focused on targeting methylation to single CpG sites 
[14-17]. 

Though methylation at single sites in eukaryotes is not believed 
to be the main means of epigenetic transcriptional silencing, 
multiple studies suggest single methylation events can alter the 
expression levels of select genes. In vitro methylation of a single 
CpG site within the S1000A2 promoter on a reporter plasmid 
resulted in significant downregulation of gene expression, upon 
transfection, relative to an unmethylated, transfected control [18]. 
Methylated oligonucleotides targeting an intronic region of 
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pmxisomal membrane protein 4 {PXMP4 orPMP24) resulted in a single 
methylation mark on chromosomal DNA that downregulated gene 
expression relative to controls; this result corroborated diflFerences 
observed between normal tissues and tumor cells [19]. Electro- 
mobility shift assays show that methylation at a single site impairs 
the binding of the genomic insulator CTCF [20] . 

In addition to studying effects on transcription, an engineered 
methyltransferase that specifically methylates a single site in a 
promoter would be generally useful for studying the effects of 
single aberrant methylation events on the propagation, mainte- 
nance, and correction of epigenetic marks. Finally, methyltrans- 
ferases were recendy engineered to more efficiently incorporate 
the transfer of unnatural aJkyl groups donated by S-adenosybne- 
thionine cofactor analogues [21]. This may make it possible to use 
targeted methyltransferases to site-specifically label DNA. 

We d(;sire a tool to assess the efferts of single methylation c'\'ents 
within the chromosome of human cell lines. As a first step, we 
describe significant progress made toward designing enzymes that 
target methylation at single CpG sites flanked by user-defined 
sequences in E. coli. The lack of endogenous CpG methyltrans- 
ferases in E. coli facilitates the assessment target and off-target 
activities in vivo. 

Our strategy for achieving single-site, targeted methylation is to 
make the assembly of a heterodimeric methyltransferase depen- 
dent on specific DNA sequences flanking a site to be methylated. 
To accomplish this task, we have employed naturally [16] or 
artificially split [17] DNA methyltransferases and altered these 
heterodimers to reduce their innate ability to r(;assi'mble into a 
functional enzyme. Reducing the ability of the fragments to self- 
assemble is necessary as we and others have shown that bifurcated 
methyltransferases are capable of unassisted reassembly into 
functional enzymes [22-25]. These reassembly-defective frag- 
ments are fused to zinc fingers, whose recognition sequences flank 
the targeted CpG site. The zinc finger domains bind to DNA, 
increasing the local concentration of the fused methyltransferase 
fragments over a targeted CpG site. Proper orientation of the 
methyltransferase fragment-zinc finger fusions at the target site 
primes the fragments for reassembly into a functional enzyme. The 
orientation of the fragments at the target site is affected by the 
topology of the fusions and the amino acid linker lengths 
connecting protein domains [17]. Optimization of these param- 
eters, as well as the reduction of the affinity of fragments for each 
other and for DNA, reduces the enzyme's non-specific activity and 
promotes enzymatic reassembly at the targeted CpG site [15-17]. 

We have previously demonstrated that, when fused to zinc 
finger proteins, a split version of M.SssI will bias methylation 
toward a targeted CpG site flanked by two cognate zinc finger 
binding sequences [17]. The DNA and amino acid sequence of 
this engineered protein is provided as Figure SI. The bifurcation 
point in M.SssI was chosen based on a CLUSTALW alignment to 
a site in a similarly engineered M.Hhal enzyme [17]. Monomeric 
M.SssI naturally methylates CpG sites [26]. Although the 
bifurcated M.SssI construct methylated the target site, it also 
methylated other M.SssI sites [17]. Site specific mutations Q_147L 
or S317A in the M.SssI domain, introduced to reduce the 
enzyme's DNA binding affinity and activity, reduced unwanted 
methylation at these other CpG sites [17]. We sought to reduce 
off-target methylation without affecting levels of methylation at the 
targeted site. Here we present a selection strategy to improve the 
targeting of methyltransferases toward new sites and have used this 
strategy to optimize our M.SssI fusion construct. We performed a 
negative selection against off-target methylation and a positive 
selection for methylation at a target site in vitro. This strategy 
allowed us to quickly identify variants with improved targeting 



ability and activity in vivo. We also demonstrate the modularity of 
our constructs by altering the zinc finger domains to redirect 
methylation toward a new target site. 

Materials and Methods 

Enzymes, Oligonucleotides and Bacterial Strains 

Restriction enzymes, T4 figase, T4 kinase, and Phusion High 

Fidelity PCR MMX were purchased from New England Biolabs 
(Ipswich, MA). Boxl was purchased from ThermoFisher Scientific 
(Waltham, MA). Platinum Pfx DNA polymerase was purchased 
from Life Technologies (Carlsbad, CA). PfuTurbo Cx Hotstart 
DNA polymerase was purchase from Agilent Technologies (Santa 
Clara, CA). Plasmid-Safe-ATP-dependent DNAse was purchased 
from Epicentre (Madison, WI). pDIMNS and pAR plasmids have 
been previously described [16,17]. All oligonucleotides and 
gBlocks were synthesized by Invitrogen (Carlsbad, CA) or 
Integrated DNA Technologies (CoralviUe, lA). Gel electrophoresis 
and PCR were preformed essentially as previously described [27]. 
Plasmids were isolated using QIAprep Spin Miniprep Kit (Qiagen, 
Valencia, CA). DNA fragments were purified from agarose gels 
using QIAquick Gel Extraction Kit (Qiagen, Valencia, CA) or 
PureLink Quick Gel Extraction Kit (Invitrogen, Carlsbad,CA, 
USA) and further concentrated using DNA Clean & Concentra- 
tor-5 (Zymo Research, Irvine, CA). 

Escherichia coh K-12 strain ER2267 [F proA^ff' lacP 
D(lac^ML5 zzf::mini-TnlO (Kan^)/D(argF-lac^U169 glnV44 
el4'(McrA") rJbDl? recAl relAl? endAl spoTl? thi-1 D(mcrC~ 
mrr)114::IS10] was acquired from New England Biolabs (Ipswich, 
MA) and was used in selections, methylation assays and cloning. 
NEB 5-alpha Competent E. coli (High Efficiency) [fhitA2D(argF- 
lac^U169 phoA glnV44 (p80A(lacZlM15 gyrA96 recAl relAl endAl thi- 
1 hsdR17] were also used for cloning and were purchased from 
New England Biolabs (Ipswich, MA). 

Plasmld Creation 

pDIMNS was used for library creation and testing of library 
variants [17]. pDIMN9 was constructed as follows for use in 
golden gate cloning. Plasmid pDIMN8 was altered by sUentiy 
mutating a Bsal site in the Amp^ gene via pFunkel mutagenesis 
[28]. PCR, digestion and cloning removed a Bbsl restriction site to 
create vector pDIMN9. Golden gate cloning was used to fuse new 
zinc finger proteins to methyltransferase fragments. For the 
creation of plasmids used in golden gate cloning, regions encoding 
zinc finger proteins were replaced with Bbsl sites. pDIMN9 
contained a M.SssI [l-272]-BbsI construct for the addition of zinc 
fingers to the N-terminal fragment. pAR contained BbsI-M.SssI 
[273-386] construct for the addition of new zinc fingers to the C- 
terminal fragments [16]. gBlocks encoding zinc fingers and Bbsl 
sites were purchased from Integrated DNA Technologies. Golden 
gate cloning to fuse zinc finger-encoding gBlocks to the above 
plasmids was performed essentially as described [29]. Zinc finger 
CD54a was designed using the zinc finger tools website and 
previously identified zinc finger domains [30-32]. As previously 
described, plasmids containing genes encoding individual C- 
terminal and N-terminal zinc finger-fused proteins were digested 
with EcoRI and Spel and ligated together, in order to place these 
genes on a single large plasmid for characterization in E. coli [16]. 
Site 1 and site 2 on this plasmid refer to previously described 
cloning sites on this large plasmid [17] and were used to construct 
the various target and non-target sites described in the study. 
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Figure 1. Schematics of the vector, library, proteins, and selection used in these experiments. (A) The vector used in selections. The 
vector encodes for both heterodimeric fragments fused to zinc fingers under the control of separate inducible arabinose (pBAD) and IPTG (lac) 
promoters, a target site, and the araC gene. (B) A schema of the zinc finger-fused, bifurcated IVI.SssI and the mutagenized codons used in library 
construction. Codons corresponding to residues 297-301 of IVI.SssI (located in the C-terminal fragment) were randomized. Numbering scheme is that 
of the wildtype M.Sssl. (C) An assembled zinc finger-fused heterodimeric IVI.SssI methyltransferase assembled at the target site. The target site consists 
of an internal CpG site nested vj/ithin an Fspl restriction site, and flanked by HS1 and HS2 recognition sequences. (D) The non-target site used to 
assess off-target methylation. The non-target site lacks HSl and HS2 zinc finger binding sites, but contains a CpG site nested within an SnaBI 
restriction site (E) An overview of the selections used in this experiment. The schematic illustrates the fates of plasmids encoding inactive 
methyltransferase (digested by Fspl, left), a desired targeting methyltransferase methylated at the target site (not digested, middle) and a nonspecific 
methyltransferase methylating multiple M.Sssl (i.e CpG) sites (digested by McrBC, right). 
doi:1 0.1 371 /journal.pone.0096931 .gOOl 



Construction of Cassette Mutagenesis Library 

An NNK cassette mutagenesis library of M.Sssl [273-386] was 
constructed by overlap extension PGR. PGR was carried out using 
an oligonucleotide degenerate for a five amino acid region in the 
G-terminal fragment corresponding to amino acids 297-301 in the 
wild type enzyme. Fragments were digested with Agel-HF and 
Spel and ligated into pDIMNS containing HS2 and the complete 
N-terminal fragment-HSl fusion (Fig. 1 A,B). HSl and HS2 have 
been described previously [33]. Site 1 contained a target site 
comprised of GpG site nested within an Fspl restriction site and 
flanked by HSl and HS2 zinc finger recognition sequences 
(Fig. IG). The plasmid also possessed a non-target site that lacked 
zinc fmger binding sites but contained an internal SnaBI 
restriction site (Fig. ID). Ligations were transformed into 
ER2267 electrocompetent cells, which were plated onto agarose 
plates containing 100 Hg/ml ampiciUin and 2% w/v glucose. 



Plates were incubated overnight at 37°G. The naive library 
contained 2x10' transformants. 

Library Selection 

Plated library variants were recovered from the plate in 
lysogeny broth supplemented with 15% v/v glycerol and 2% w/ 
v glucose and stored at — 80°G. Aliquots were thawed and used to 
inoculate 10 ml of lysogeny broth supplemented with 100 Hg/ml 
ampiciUin salt, 0.2% w/v glucose, 1 mM IPTG, and 0.0167% w/v 
arabinose. These cultures were incubated overnight at 37°G and 
250 rpm. Plasmid DNA was isolated via QIAprep Spin Miniprep 
Kit and digested for 3 hours at 37°G with McrBG (10 units/|a,g 
DNA), Fspl (2.5-5 units/ng DNA) in IX NEBuffer 2 supple- 
mented with 100 |J,g/ml BSA and 1 mM GTP. Reactions were 
halted by incubation at 65°G for over 20 min to which ExoIII (30 
units/|a,g DNA) was added and the solution incubated at 37°G for 
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Figure 2. Methylation assay for selected variants. (A) Relative locations of the target site and non-target site on a plasmid linearized by Ncol 
digestion. For sequences of the target and non-target sites, refer to Figures 1C and D (B) The restriction endonuclease protection assay for 
methylation at the target and non-target site uses digestion w\th Ncol to linearize the plasmid and either Fspl or SnaBI to assess the target and off- 
target methylation, respectively. Fspl and SnaBI cannot digest a methylated site. Shown are results from select variants as well as the 'wildtype' 
heterodimeric enzyme (i.e. no mutations to residues 297-301) with or without a catalytically inactivating (C141S), or a catalytically compromised 
(Q147L) mutation. 

doi:1 0.1 371 /journal.pone.0096931 .g002 



60 min. ExoIII digestion was halted by incubation at 80°C for 
over 30 min and the DNA was desalted using Zymo Clean and 
Concentrator-5 kits per manufacturer's instructions. DNA was 
transformed into ER2267 electrocompetent cells and plated on 
agar supplemented with 2% w/v glucose and 100 |J.g/ml 
ampicHlin salt. 

Cells were recovered from the plate as before and plasmid DNA 
was isolated using the QIAprep Spin Miniprep Kit. The DNA was 
digested with Fspl (2-2.8 units/|^g DNA) in IX NEBuflFer 4 and 
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Figure 3. Sequence conservation at residues 297-301 of all 
catalytically active selected variants. (A) The wild type sequence 
for residues 297-301 (B) A sequence logo of active variants. 
doi:1 0.1 371 /journal.pone.0096931 .g003 



linear DNA was isolated via gel electrophoresis. PGR was used to 
amplify the portion of the linear plasmid containing genes 
encoding for the N-terminal and C-terminal fragments fused to 
zinc fingers. Purified PGR products were subcloned into the 
selection plasmid for an additional round of selection. 

Restriction Endonuclease Protection Assays 

Cultures from colonies were incubated overnight at 37°C and 
250 rpm in lysogeny broth supplemented with 0.2% w/v glucose 
and 100 |J,g/ml ampiciUin salt and stored as glycerol stocks. 
Glycerol stocks were used to inoculate 10 ml of lysogeny broth 
supplemented with 100 Hg/ml ampiciUin salt, 0.2% w/v glucose, 
1 mM IPTG, and 0.0167% w/v arabinose. After growth 
overnight at 37°G and 250 rpm, plasmid DNA was purified from 
the cultures with a QIAprep Spin Miniprep Kit. Plasmid DNA 
(500 ng) was digested with NcoI-HF (10 units) and either Fspl (2.5 
units) or SnaBI (2.5 units) in IX NEBuffer 4 for over one hour at 
37°G. SnaBI digests were supplemented with 100 Hg/ml BSA. 
Half of each digested sample was loaded onto agarose gels (1.2% 
w/v in TAE) and electrophoresed at 90 V for 105-120 minutes. 

Bisulfite Analysis 

Glycerol stocks of ER2267 cells containing a plasmid encoding 
the methyltransferase variants were used to inoculate 10 ml of 
lysogeny broth supplemented with 100 M-g/ml ampicUlin salt, 
0.2% w/v glucose, 1 mM IPTG, and 0.0167% w/v arabinose. 
Cultures were incubated for 12-14 hours at 37°C and 250 rpm 
and plasmids were isolated as above. Plasmids (2 (ig) were 
linearized with IX NcoI-HF (20 Units/ug DNA) in IX CutSmart 
Buffer. Linear plasmids were purified using DNA Clean & 
Concentrator-5 (Zymo Research, Irvine, CA). Linearized plasmids 
(500 ng) were treated with bisulfite reagent using the EZ-DNA 
Methylation Gold Kit (Zymo Research, Irvine, CA). Touchdown 
PGR, using PfuTurbo Cx Hotstart DNA polymerase was used to 
amplify regions encoding the target and the non-target sites and 
was modified from [34]. An initial cycle of 95°C for 3 min was 
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Figure 4. Bisulfite analysis of the frequency of methylation at target, non-target, and other proximal CpG sites. Percent methylation 
observed at individual CpG sites at and adjacent to the (A) target site and (B) non-target site. Percentages at each CpG site were determined by 
bisulfite sequencing of n number of clones (n indicated at right). CpG sites are numbered sequentially from 1-48 or 1-60 based on their order in the 
sequencing read and thus, the figure does not indicate the distance between sites. The asterisks indicate two CpG sites at which the methylation 
status of one clone was judged to be indeterminate due to poor sequencing quality in this region. Black, 'WT' heterodimeric enzyme (KFNSE); orange, 
PFCSY variant; blue, CFESY variant. The target and non-target CpG sites (i.e. the two sites assessed by restriction enzyme digests) are indicated by the 
arrows. The length of DNA subjected to bisulfite sequencing was 548 bp for the target site and 631 bp for the non-target site. 
doi:10.1371/journal.pone.0096931.g004 



followed by a touchdown PGR (95°C for 1 min, annealing 
temperature for 1 minute, 72°C for 2 minute). The annealing 
temperature started at 64°C and was dropped 2°C degrees after 
two cycles and then decreased 1 °C after every other cycle until the 
annealing temperature reached 52°C. After the touchdown PGR, 
an additional 30 cycles were carried out with the parameters above 
and an annealing temperature of 51°C. A final extension was 
carried out at 72°C for 10 min. The antisense strand at the target 
site was amplified with primers 5'-AAG AC A GAG GTG AAA 
CTA AAT AAG GTT GGG CAT TAT AAT TGT TCT'(Fw) 
and 5'-CCG TAG CCA TGG TAT ATT TTT AAT AAA TTT 
TTT AGG GAA ATA GGT TAG GTT TTT AT-3' (Rev). The 
antisense strand at the non-target site was amplified with primers 
5'-AAG ACA GAG CTC CTC TAG TAA TCC TAT TAG 
CAA TAA CTA CTA CCA ATA A-3'(Fw) and 5'-CCG TAG 
CCA TGG GTA AAG TTT GGG GTG TTT AAT GAG TGA 
GTT AAT TTA TAT TAA TTG-3' (Rev). PGR amplified 
products were purified by gel electrophoresis as above digested 
with SacTHF and NcoTHF, ligated into pDIMN9 and trans- 
formed into NEB 5-alpha Competent E. coli (High Efficiency). 
Individual colonies were sequenced and analyzed using quantifi- 
cation tool for methylation analysis (QUMA) [35]. Low quality 
sequences were excluded if they had more than five unconverted 
CpH sites or if less than 95% of all CpH sites were converted. 
Sequences were also excluded if they either had over 10 ahgnment 



mismatches or less than 90% percent identity to the reference 
sequence. 

Results and Discussion 

Design of the Selection System 

Our in vitro selection system preferentially enriches variants from 
a mutagenesis library that possess the ability to methylate a target 
site, but also lack the ability to methylate other non-targeted 
M.SssI sites on the plasmid. In vitro selection strategies have been 
used to enrich for methyltransferases with relaxed or altered 
specificity. Most strategies rely on methylation-dependent protec- 
tion from restriction endonuclease digestion to positively select for 
DNA encoding a methyltransferase with altered specificity [36— 
40]. Our selection scheme differs from previous studies as it 
additionally employs McrBG as a negative selection against 
unwanted methylation activity. In our system for altering 
methyltransferase specificity, a single plasmid contains both genes 
encoding the zinc finger-fused M.SssI fragments as well as a 
targeted M.SssI CpG site that is nested within an Fspl restriction 
site and flanked by zinc finger binding sequences (Fig. 1 A-C). The 
plasmid also has over 400 other M.SssI (i.e. CpG) sites and a non- 
target site, comprised of a SnaBI restriction site, for the assessment 
of off-target methylation (Fig. ID). Once transformed into E. coli, 
the methyltransferase fragments encoded by the plasmid are 
expressed, resulting in methylation of the same plasmid. The 
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Figure 5. Substitution of new zinc fingers targets metKiyiation 
towards a new site. (A) A schematic of the designed methyltrans- 
ferase is shown assembled over the new, targeted CpG site. New 
cognate zinc finger recognition sequences flank a CpG site nested 
within an Fspl site. Zinc fingers CD54-310pt and CD54a have replaced 
the HS1 and HS2 zinc fingers. (B) The non-target site contains the HS1 
and HS2 zinc finger recognition sites flanking a CpG site nested within a 
Fspl restriction site (i.e. this was the target site in experiments in 
Figure 2). (C) The relative locations of the target site and non-target site 
are shown on a plasmid linearized by Ncol digestion. (D) The restriction 
endonuclease protection assay for methylation at the target and non- 
target site for the 'wildtype' heterodimeric enzyme (KFNSE) and two 
selected variants with mutations in the region 297-301. 
doi:1 0.1 371 /journal.pone.0096931 .g005 

plasmid DNA is isolated and subjected to in vitro digestions with 
endonucleases Fspl and McrBC (Fig. IE). Since Fspl digestion is 
blocked by methylation, Fspl digestion serves to select for 
methylation at the targeted CpG site. McrBC is an endonuclease 
that recognizes and cleaves DNA with two distally methylated sites 
[41,42]. McrBC will not digest a single site that is methylated or 
hemimethylated unless there is a second methylated site on the 
same DNA within about 40-3000 bp [43]. We therefore expect 
that most plasmids methylated at multiple M.SssI sites wiU be 
digested by McrBC. Thus, McrBC digestion selects against o£F- 
target methylation. The DNA is then incubated with ExoIII to 
degrade any plasmid that is digested at least once, ideally leaving 
the plasmid DNA encoding a highly specific methyltransferase 
intact for the subsequent transformation. 

Initial proof of principal selections demonstrated that McrBC, 
Fspl and ExoIII treatment of unmethylated plasmid DNA, 



followed by transformation resulted in a 99.85% decrease in the 
number of transformants relative to untreated DNA. Similarly, 
McrBC, Fspl and ExoIII treatment of a highly methylated plasmid 
reduced transformants by 99.95% relative to untreated control. 

Design of the Library 

We constructed a library of M.SssI C-terminal fragment 
variants randomized at residues 297-301 (Fig. IB). We hypothe- 
sized that mutations to these residues might reduce the ability of 
the split methyltransferase to methylate non-targeted CpG sites by 
reducing the fragment's inherent affinity for double-stranded 
DNA. Early studies indicated that M.SssI interacts with DNA, 
irrespective of the presence of CpG sites and subsequentiy 
methylates processively [44]. Further, a homology model of 
M.SssI suggested that residues 297 and 299 form contacts with the 
ribose phosphate backbone on the CpG bases complementary to 
the methylated CpG site [45] . Mutational studies showed that for 
monomeric M.SssI, K297A or N299A mutations did not 
appreciably affect either the catalytic activity or the dissociation 
constant of a CpG containing oligonucleotide [46] . Mutating these 
residues, we hypothesized, might eliminate the innate afiinity of 
our fragments for DNA without affecting the catalytic activity of 
the enzyme. 

Additionally, the homology model indicated the amide back- 
bone of serine residue at position 300 made base-specific contacts 
with the cytosine and guanine bases complementary to the 
methylated strand. This model initially implicated serine's 
conserved and catalyticaUy important role for stabilizing the 
complementary strand during base flipping and methylation [45] . 
However, the S300P mutation resulted in only a three-fold 
increase in a dissociation constant and no significant change in 
initial rate of reaction [47] . 

Library Selections 

Initial selection experiments on this library resulted primarily in 
the isolation of plasmid DNA with a deleted Fspl restriction site, 
presumably formed by a recombination event. This false positive 
was a trivial, albeit frequently observed, solution for plasmid 
survival in our devised scheme. Thus, we subjected the plasmid 
DNA from the resulting transformants to additional steps to enrich 
for those plasmids that survived our selection and retained their 
Fspl site. In these additional steps, the plasmid DNA was 
transformed into ER2267 cells and the cells were plated under 
conditions known to repress the promoters controlling methyl- 
transferase fragment expression. We digested plasmid DNA from 
these cells with Fspl and purified the linear, Fspl-digested DNA 
away from undigested plasmid DNA by agarose gel electropho- 
resis. The portion of the plasmid encoding the zinc fingers and 
methyltransferase genes was PGR amplified, ligated back into the 
same plasmid backbone, and subjected to an additional rovmd of 
selection ('recombinant removal' step in Fig. IE). The additional 
round of selection also included this Fspl site-enrichment step. 
Variants were then randomly selected for further analysis. 

Analysis of Library Variants Identified by Selections 

We assayed 47 variants identified from our selections for 
methylation activity at both the target and non-target site using 
our restriction digest assay and determined the variants' sequenc- 
es. Representative variants from these digest assays are shown in 
Figure 2B. Most active variants qualitatively displayed biased 
methyltransferase activity toward the targeted site. A complete list 
of sequenced variants can be found in Table S 1 . 

We compiled a list of amino acid sequences from all the active 
variants that were assayed in our digestion assays to create a 
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sequence logo using using weblogo 3.3 [48,49]. This sequence logo 
indicated that a functional heterodimeric methyltransferase 
strongly preferred certain residues at positions 298 and 300 
(Fig. 3). Position 298 (vvildt)'pe phenylalanine) was almost 
exclusively composed of aromatic residues. Position 300 (wildtype 
serine) was almost exclusively composed of small residues (defined 
as an amino acid with an R side chain containing 1-3 heavy 
atoms). The observed conservation at these residues is consistent 
with sequence alignments showing these two residues are relatively 
well-conserved among methyltransferases of different species [45] . 
In contrast, positions 297, 299 and 301 exhibited littie preference 
for specific amino acids. This finding Ls consistent with the 
mutational study discussed above [46]. Our study reveals that 
there are numerous solutions for improving the specificity of our 
zinc finger-fused, bifurcated methyltransferases. 

To further characterize our engineered methyltransferases, we 
subjected plasmids containing optimized variants, PFCSY, 
CFESY (named for the sequence at residues 297-301), and the 
un-optimized 'WT' variant to bisulfite analysis at both the target 
and non-target sites. These plasmids were isolated from cultures 
grown under conditions known to induce the expression of the 
plasmids' methyltransferase fragment fusion genes. In addition to 
asses.sing levels of methylation at the target and non-target CpG 
sites, the regions subjected to bisulfite sequencing assessed the 
methylation status of 47 and 59 additional CpG sites around the 
target and non- target sites, respectively (covering over 25% of the 
total CpG sites present on the plasmid). We sequenced S15 clones 
for each variant to quantify the frequency of methylation at all 
CpG sequences around both sites (Fig. 4A,B). Based on this 
sequencing, the PFCSY variant methylated the target site at a 
frequency of 78.9%. In contrast, only fifteen olf-target methylation 
events were observed in the 34 sequence reads (out of a total of 
1793 possible oflF-target methylation events), which corresponds to 
an oflF-target methylation frequency of 0.84%. The PFCSY 
variant's specificity for the target site is a marked improvement 
over the un-optimized, 'WT' variant, which methylated the target 
site at a frequency of 94. 1 % and olf-target sites at a frequency of 
49.5%. Thus, for this variant, our selections resulted in the 
identification of a variant with an almost 60-fold reduction in off- 
target methylation and a minimal decrease in methylation at the 
target site. The CFESY variant methylated the target site at a 
lower frequency compared to the PFCSY variant, but exhibited a 
similar low frequency of methylation at other CpG sites (target 
frequency of 42.1% and a 0.71% frequency at all other CpG sites). 

The Targeted Heterodimeric Methyltransferases are 
Modular 

To test whether our targeted M.SssI methyltransferases are 
modular with respect to the zinc finger domains, we replaced zinc 
fingers HSl and HS2 with two zinc fingers designed to target a 
specific site in the promoter of intercellular adhesion molecule 1 

(ICAMl). The previously designed zinc finger CD54-310pt [50] 
is adjacent to a CpG site in this promoter. To generate a pair of 
zinc fingers capable of flanking this CpG site, we designed a 
second zinc finger, CD54a, to bind downstream from the 
recognition sequence of CD54-310pt and adjacent CpG site 
(Fig. 5A). The two zinc fingers were fused to fragments comprising 
un-optimized bifurcated M.SssI fragments (residues KFNSE at 
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positions 297-301) and to two selected variants (CFESY and 
SYSSS at positions 297—301), replacing the HSl and HS2 zinc 
fingers (Fig. 5 A). These two optimized variants (CFESY and 

SYSSS) were chosen because preliminary experiments (preformed 
essentially as described in [17]) suggested that methylation at the 
target site (containing both zinc finger binding sites) was greater 
than the additive amount of methylation levels observed at "half- 
sites" composed of only one or the other of the zinc finger binding 
sequences. 

We assessed the methyltransferase activity and specificity of 
these constructs in E. coli with a restriction endonuclease 
protection assay at the target and non-target sites (Fig. 5A-D). 
Notably, the 'non-target' site assessed in this experiment contained 
the zinc finger sequences recognized by HS 1 and HS2 zinc fingers 
(compare Fig. 5B and IC). Although all three constructs 
methylated the target site derived from the ICAMl promoter, 
the CFESY and SYSSS constructs targeted methylation to the 
desired site with littie to no observable methylation at the non- 
target site (Fig. 5D). 

The CD54-310pt was chosen because it was shown to 
effectively target the ICAMl promoter, altering transcription 
levels when fused to transcriptional activators or repressors 
[50,51]. AdditionaUy, fusion of CD54-310pt to Ten-Eleven 
Translocation 2 enzyme resulted in a smaU, observable amount 
of demethylation around the target site, correlating with a 2-fold 
upregulation in ICAMl transtxiption [52]. Our construct may 
potentially enable assessment of the biological affects of targeted 
methylation at this site. 

Supporting Information 

Figure SI The DNA and amino acid sequences for the 
(A) N-terminal and (B) C-terminal M.SssI fragments 
fused to CD54-310pt and CD54a respectively. The 

methyltransferase fragments (cyan), amino acid linkers (yeUow), 
and zinc finger domains (red) are shown along with the 'wildtype' 
sequence from 297-301 (KFNSE) shown in magenta. 
(PDF) 

Table SI Variants from the selected library. Sequenced 
library variants are shown. Aromatic amino acids at position 298 

are highlighted in yellow and small amino acids (defined as an 
amino acid with an R side chain containing 1-3 heavy atoms) at 
position 300 are highlighted in cyan. Stop codons are denoted by a 
"Assayed" column has an "x" if the variant was tested in the 
restriction endonuclease protection assay at the target and non- 
target site. "Active" column has an "x" if the assay indicated 
protection from restriction enzyme digestion at one or both sites. 
(PDF) 
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